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ABSTRACT 



Because correct configuration data is essential to the opera- 
tion of any RAID system, and because multiple copies of the 
configuration data are kept, not only in the RAID controller 
itself, but also in each disk drive unit in the configuration, it 
is imperative that the various copies of the configuration 
data do not become "out of synchronization", which means 
that one or more copies of the configuration data arc 
different from one or more other copies of the configuration 
data. To maintain synchronization of all copies of the 
configuration data, the current invention compares the con- 
figuration data stored in the RAID controller's NVRAM to 
that of the current system, and records any new, non- 
responding, repositioned or unidentified storage devices in a 
change list. The identities and the attachment points of any 
storage devices in the change list are then displayed to the 
user, and various options for correcting the lack of synchro- 
nization are provided by enabling one or more function keys 
that may be selected by the user. One such function key, an 
"accept change*' key, causes the configuration data to be 
modified by changing the current operational state of any 
non-responding storage device. A "configuration adjust- 
ment^' key, when selected by the user, causes the configu- 
ration data to be modified to reflect any changes in the 
attachments points of any repositioned storage devices. An 
"import configuration" key causes the configuration data to 
be changed to correspond to that of an unidentified storage 
device. And a "retry" key gives the user an opportunity to 
rearrange the system and then re-compares the configuration 
data in the controller to the current system configuration. 

5 Claims, 5 Drawing Sheets 
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RAID ARRAY CONFIGURATION 
SYNCHRONIZATION AT POWER ON 

CROSS REFERENCE TO RELATED 
APPLICATIONS 

This application is a continuation-in-part of co-pending 
application Ser. No. 08/864,530 filed May 28, 1997, which 
Ls entitled "Run Time Validation Of SCSI RAID Array, 
Configuration". 

BACKGROUND OF THE INVENTION 

This invention pertains to conaputers and other informa- 
tion processing systems and, more particularly, to an infor- 
mation processing system that includes a redundant array of 
inexpensive disks ("RAID") with a means for synchronizing 
the disk configuration data. 

As the performance of microprocessor and semiconductor 
memory technology increases, there is a need for improved 
magnetic disk storage technology with comparable perfor- 
mance enhancements. Unfortunately, the performance of 
newer processor and semiconductor memory technology has 
out paced that of magnetic disk storage technology. In 1988, 
however, a paper published by Patterson, Gibson and Katz 
("A Case For Redundant Arrays Of Inexpensive Disks 
(RAID)"; International Conference On Management Of 
Data; pp. 109-116; June 1988) laid the foundation for the 
use of a redundant array of inexpensive disks ("RAID") that 
would not only significantly improve the data transfer rate 
and data I/O rate over a comparable single disk access, but 
would also provide error correcting redundancy and lower 
cost. In their paper, Patterson, Gibson and Katz described 
five "levels" of RAID systems, which are briefly described 
below (RAID-1 through RAID-5). Since that time, other 
RAID levels have been described, two of which are also 
briefly described below (RAID-0 and RAID-6). 

Most RAID systems incorporate both redundancy and 
some form of data interleaving, which distributes the data 
over all the data disks in the array. Redundancy is usually in 
the form of an error correcting code, with simple parity 
schemes predominating. However, RAID-1 uses a "mirror- 
ing" redundancy scheme in which duplicate copies of the 
same data are stored on two separate disks in the array. 
Parity and other error correcting codes are either stored on 
one or more disks dedicated for that purpose only, or they 
may be distributed over all the disks in the array. Data 
interleaving is usually in the form of data "striping" in which 
the data to be stored is broken down into blocks called 
"stripe units", which are then distributed across the data 
disks. A typical size of a stripe unit is 8K to 64K Bytes. A 
"stripe" is a group of corresponding stripe units, one stripe 
unit from each disk in the array. Thus, the "stripe size" is 
equal to the size of a stripe unit times the number of data 
disks in the array. Data interleaving may also be accom- 
plished on a bit-by-bit basis, such as is described in more 
detail below with regards to RAID -3. Six RAID levels wiU 
now be described. 

RAID-0 utilizes data striping, but does not use redun- 
dancy. RAID-0 has a lower cost than any other RAID level, 
and its write performance is the best because there is no 
writing of redundant information. The primary disadvantage 
of RAID-0 is its lack of redundancy. Consequently, any 
single disk failure in the array results in lost data. 

RAID-1 uses mirroring in which identical data is stored 
on two disks. An advantage of RAID-1 is that it is simple to 
implement in software. RAID-1 is also error correcting 
because complete recover is possible from the failure of any 
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one disk drive by simply switching to the drive that contains 
the duplicate copy of the data. After replacing the defective 
drive, the data on the duplicate drive can be recopied to the 
replacement drive. When servicing two or more requests to 

5 read data that is stored on the same disk, RAlD-1 has a faster 
read rate than RAID-0 because one request can be serviced 
from the first disk, and the second request can be simulta- 
neously serviced by the duplicate disk. A disadvantage of 
RAID-1 is that it is expensive because it requires two times 

10 the number of drives necessary to stored the same data. 
Thus, its efficiency is always V^. TTie necessity of making 
duplicate copies of all data also makes this RAID level 
somewhat slow to write data. 

RAID-2 uses error correcting codes such as those found 

15 in error correcting semiconductor memory systems. 

RAID-3 uses a separate parity disk to store error correct- 
ing parity information and a plurality of data disks that 
contain bit interleaved data information. Unlike semicon- 
ductor memories, a faulty disk drive is usually easily iden- 
tified because disk drives and their associated controllers 
typically contain sophisticated error detecting mechanisms 
that can quickly identify a failed drive. Consequently, if a 
single data drive has failed, the contents of the failed drive 
can be easfly reconstructed using the information from the 
"good" data drives plus the parity drive. Conceptually, the 
reconstruction of a specific bit of a failed drive could be 
accomplished by calculating the parity of the corresponding 
bit of each of the "good" drives and then comparing it to the 
corresponding bit of the parity drive. For example, if the 
parity of the first bit of each of the "good" drives is a logical 
0, and the first bit of the parity drive is a logical 1, then the 
first bit of the failed drive must have been a logical I 
(because the parity of the first bit of all the data drives must 
equal logical 1 , in this example). Mathematically speaking, 
the data on the failed disk can be calculated by starting with 
the parity information from the parity drive and subtracting, 
modulo two, the corresponding information on the "good" 
data drives. If, on the other hand, the parity drive fails, parity 
is easily reconstructed from all the data drives. 

40 . 

For this RAID level, data is bit interleaved on the data 
disks. For example, a basic RAID-3 system in which data is 
organized in 8 bit bytes and having 8 data disks and one 
parity disk would store the first bit of every byte on the first 
disk, the second bit of every byte on the second disk, on so 
on. Thus, a write request simultaneously accesses all 8 data 
disks plus the parity disk, while a read request accesses all 
8 data disks. Consequently, the data rate, which is the rate at 
which data can be written to or read from sequential loca- 
tions on the disk without head repositioning, is very high for 
RAID-3. A primary disadvantage of this RAID level is that 
it only permits one request to be serviced at any one time. 
RAID-3 systems also have relatively low I/O rates, which is 
the rate at which data can be written to random locations on 
the disk, thereby requiring frequent head repositioning. 

RAID4 also uses a separate parity disk to store error 
correcting parity information and a plurality of data disks 
that contain interleaved data information. Unlike RAID-3, in 
which data is bit interleaved across the data disks, RAID4 
60 uses block interleaving or data striping, which is described 
in more detail above. 

The performance of RAID4 is particularly dependent on 
the type of access requested, read or write, and the size of the 
requested access relative to the size of the stripe unit and the 
65 size of the stripe. A request to read a block of data that is 
contained entirely within one stripe unit can be quickly 
serviced as soon as the disk drive containing the requested 
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data becomes available. Consequently, multiple requests to of configuration data or inaccuracies in configuration data 

read various blocks of data, each of which is entirely can be catastrophic. Therefore, to prevent the loss of accu- 

contained within one stripe unit on a different data drive, can rate configuration data, multiple copies of the configuration 

be serviced simultaneously. In contrast, a RAID-3 system data arc usually kept in various locations throughout the 
must service multiple requests serially, and if head reposi- 5 RAID system. In the RAID adapter, one copy of the con- 

tioning is required between the servicing of each request, the figuration data is kept in a non-volatile memory (NVRAM) 

performance of a RAID-3 system will be dramatically and another copy is kept in a flash memory, while additional 

slower than a RAID -4 system for this type of access. A read copies are kept in each of the disk drives in the array. Each 

operation of stripe size data blocks can also be very fast in copy of the configuration data stored in the disk drives are 
RAID4, particularly if scheduling permits all data disks to be lo usually stored at a predetermined location, for example, the 

accessed at one time. last 16 sectors of each disk. In any one properly configured 

A request to write data to a single stripe unit can be a system, these various copies of the configuration data must 

relatively slow process, because it requires four disk t>® identical. 

accesses. Specifically, a data write to a single stripe unit There are a number of reason that might cause the various 
requires that the old data and corresponding parity informa- 15 copies of the configuration data of a system to become "out 
tion be read from the appropriate data disk and the parity of synchronization^', which means that one or more copies of 
disk. Next, new parity information is computed using the old the configuration data are different from one or more other 
data, the new data and the old parity. Finally, the new data copies of the configuration data in the system. For example, 
and the new parity are written to the data and parity disks, the RAID adapter itself may have been replaced due to its 
respectively. Requests for multiple writes to various stripe failure or for some other reason. It is even possible to 
units located on different drives and in different stripes is remove and replace the NVRAM or its battery power source 
even more problematic, because each write requires a read from the adapter, thereby causing the loss of all configura- 
and write operation to the parity disk and, since there is only tion data in the NVRAM. In addition, one or more disk 
one parity disk, it can become "bottle necked." Writing an drives may have been added to the system, removed from 
entire stripe of data is much easier because no read opera- 25 the system or replaced with different drives, 
tions are required and the parity for the new stripe of data is while many RAID system include "hot swapping^' 
easily computed. capability, which permits disk drives to be replaced with 
RAID-5 is similar to RAID-4 in that it interleaves data by power on, changes to the physical configuration of disk 
stripe units across the various disk drives, and also stores drives in the array may also occur when the power is off. 
error correcting parity information. In RAID-5, however, One particular application for such changes in the physical 
there is no dedicated parity disk as there is in RAID-3 and configuration of disk drives is found in high security systems 
RAID4. Instead, RAID-5 distributes parity across all the wherein disk drive units are removed from the host corn- 
disk drives, thereby eliminating the parity disk bottleneck puter system at night and locked up. In the morning, these 
problem described above with regards to certain write disk drives are returned to the host system and may, or may 
operations of RAID4 systems. Furthermore, because not, be plugged back into the same physical "slot" or "bay** 
RAID-5 distributes data over all the disks, and RAID4 only in the host system that they were installed in the previous 
distributes data over the data disks (which is equal to the day. Other improper physical configuration errors may also 
total number of disks minus the total number of parity occur, such as the insertion of a "foreign" disk drive into the 
disks), RAID-5 has a slight performance advantage over host system, or the failure to replace one of the disk drives, 
RAID4. With these performance enhancements, RAlD-5 is thereby leaving an empty slot. 

usually preferred over RAID4 and, consequently, most Accordingly, the invention described below checks for 

RAID4 systems have disappeared from the market to be proper configuration synchronization at power on, warns the 

replaced by RAID-5 systems. user of such configuration errors, and provides numerous 

RAID-6 is similar to RAID-5 in that it interleaves data in options for the user to correct the errors, 
stripe units and distributes parity information across all the 

disk drives. A disadvantage of RAID-5, as well as RAID-3 SUMMARY OF THE INVENTION 

and RA1D4, is its inability to correct a failure in more than g^.^^ .^^^^^^^^ ^ ^ ^.^^ 

one single disk^ As the size of disk arrays increases, removable and identifiable storage devices that can be 

however, the probability of a failure m more than one drive ^^^^^^^^ ^ ^ ^^^^^ ^^^^^^^ 

also mcreases which, m turn, increases the chance of an u„ * • + ti, n Atn * • i j c 

, , ^ ' * , , - , attachment pomts. The RAID system includes means for 

unrecoverable failure. To overcome the problem of a two , • ^ * *- j * \ • *u c . 

T^^T,^^ ^ . «^ stonng first configuration data, wherein the first confieura- 

dKk failure, RAID-6 uses Reed-Solomon codes in a P+Q Uon data is indicative of a first system configuration in which 

redundancy scheine that can recover from a failure of any ^ j^^^^j removable storage devices are configured 
two disks. One disadvantage of RAID-6 oyer RAID-5 „ for attachment to the RAID system. The first configuration 

however is in the write performance of small amounts of j^^^ -^^^^^^^ information regarding the identity, attachment 

data on the order or one Stripe unit in size. Recall that writes „ i r»„.^ ~p ^ « ' r »l « » 

„ „ ^ . .r.ATT>^. r point and the operational state or each one oi the rirst 

of small amounts of data arc slow m RAID-5 because four „f ^ ^^^-^^^ .^^^^^^^ ^ ^ ^^^^^ 

data accesses are requiredj^or such small wnle operations comparing the first configuration data stored in the RAID 

RAID-6 is eve.1 more ^efficient because it requires a total ^ ^ ^^^^ ^ configuration and for identifying 

of SIX accesses to update both the "P and "Q information. ^.^^g^ devices that are a part of the second system 

Configuration Changes Prior To Power On configuralion but are not a part of the first system 

configuration, for identifymg any non-respondmg storage 

A RAID system includes configuration data, which devices that are a part of the first system configuration but 
includes operating parameter information for each disk drive 65 are not responding in the second system configuration, for 

in the array. Correct configuration data is essential to the identifying any repositioned storage devices that were 

operation of any array of independent disk drives, and loss located at a one attachment point in the first system con- 
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figuration and were repositioned to a different attachment the first configuration data is indicative of a first system 
point in the second system configuration, and for identifying configuration in which a first plurality of removable storage 
any unidentified storage devices that are a part of the second devices are configured for attachment to said RAID system, 
system configuration but are not a part of the first system The first configuration data includes information regarding 
configuration. The means for comparing also records the s the identity, attachment point and the operational state of 
identities and attachment points of any such new, non- each one of the first plurality of storage devices. The system 
responding, repositioned, and unidentified storage devices in also includes a means for comparing the first configuration 
a change list. Meas for displaying the identities and attach- data stored in the RAID system to a second system con- 
ment points of any storage devices that are listed in the figuration and for identifying any non-responding storage 
change list are also provided. The RAID system also devices that are a part of the first system configuration but 
includes means for enabling an "accept change" key in are not responding in the second system configuration. The 
response to the listing of a non-responding storage device in means for comparing also includes means for recording the 
the change list, the accept change key, when activated by a identities and attachment points of any such non-responding 
user, causes the first configuration data to be modified by storage devices in a change list. The RAID system also 
changing the current state of any non-responding drive to a 15 includes means for displaying the identities and attachment 
new state. A means for enabling a "configuration adjust- points of any storage devices that are listed in the change list, 
ment" key in response to the listing of a repositioned storage Furthermore, the RAID system also includes means for 
device in the change list is also included. The configuration enabling an "accept change" key in response to the listing of 
adjustment key, when activated by a user, causes the first a nonresponding storage device in the change list, the accept 
configuration data to be modified to reflect any changes in 20 change key, when activated by a user, causes the first 
the attachment points of any repositioned storage devices. configuration data to be modified by changing the current 
The system further includes means for enabling an "import state of any non- responding drive to a new state, 
configuration" key in response to the listing of an uniden- 
tified storage device in the change list. The import configu- BRIEF DESCRIPTION OF THE DRAWINGS 
ration key, when activated by a user, causes the first con- 25 

figuration data to be modified to correspond to the system . ^ ^ ^ ^^^^^ diagram of a RAID system of the present 

configuration of an unidentified storage device. In addition, mvention. 

a means for enabling a "retry" key in response to the listing FIG. 2 is a flow diagram of the process that is executed at 

of a storage device in the change list is also included. The power on and that produces the various change lists, 

retry key, when activated by a user, causes the means for 30 FIG. 3 is a flow diagram of the process that displays drive 

comparing to re-corapare the first configuration data stored identification data for drives that are on a change lists, and 

in the RAID system to the second system configuration, enables various options for user responses to the various 

thereby giving the user an opportunity to reanrange the changes. 

configuration of the second system configuration to corre- pj^. 4 is a diagram of an algorithm for logical drive 

spond to the first system configuration. 35 migration 

In another embodiment the invention is a RAID system ^able 1 is a code table that lists the configuration data that 

for use with removable and identifiable storage devices that ^ ^^^^^ ;„ ^^^^ ^^^^ ^^.^^ ^^^^^j^^^ 

can be attached to the R^D system at a plurality of diflerent ^j^^^^ controller's NVRAM 

attachment pomts. The RAID system includes means for /„. r,^^, 

- ^ ^ . , . . . ^ and rlasn memory. 

storing first configuration data, wherein the first configura- 40 ^ . , ^ . , 

tion data Ls indicative of a first system configuration in which , ™^ ^ ^ ^^^.^ ^^^^f [^e system management 

a first plurality of removable storage devices are configured ^^^^ ^^^^ ^^^^^^ ^^^^ °f ^'^^ ^^^^^ ^"^^ ^^^^^^^^ 

for attachment to the RAID system. The first configuration 'y^^^' Z u '° controller's 

data includes informaUon regarding the identity, attachment NVRAM and flash memory. 

point and the operational state of each one of the first 45 DESCRIPTION OF THE ILLUSTRATIVE 

plurality of storage devices. The system also includes means EMBODIMENTS 

for comparing the first configuration data stored in the RAID 

system to a second system configuration and for identifying Referring to FIG. 1, the RAID system 100 of the preferred 

any unidentified storage devices that are a part of the second embodiment includes a Small Computer System Interface 

system configuration, but are not part of the first system 50 ("SCSI") bus 101, although other bus architectures may also 

configuration. The means for comparing also includes be used. A disk drive unit 102 can be connected to bus 101, 

means for recording the identities and attachment points of but is illustrated in FIG. 1 as being disconnected from the 

any such unidentified storage devices in a change list. In bus. Disk drive unit 102 includes a well known power reset 

addition, the system further includes means for displaying subsystem 103 that sends a signal out over bus 101 to 

the identities and attachment points of any storage devices 55 indicate that a power reset has just occurred. For disk drive 

that are listed in the change list. Also included in the system units that are designed to be attached to a SCSI bus, the 

is a means for enabling an "import configuration" key in signal that is sent out over the bus in response to a power 

response to the listing of an unidentified storage device in reset is well known in the art and is commonly called a "unit 

the change list, the import configuration key, when activated attention error" signal. Disk drive unit 102 also includes a 

by a user, causing the first configuration data to be modified eo well known error or defect detection subsystem 104 that can 

to correspond to the system configuration of an unidentified detect defects in the disk drive unit, 

storage device. Configuration data 105 is stored in disk drive unit 102, for 

In yet another embodiment, the invention is a RAID example, in the last 16 sectors of the disk. The configuration 

system for use with removable and identifiable storage data is illustrated in more detail in Table 1, which is a 

devices that can be attached to said RAID system at a 65 computer code listing of all the data that, collectively, makes 

plurality of different attachment points. The RAID system up the configuration data. The configuration data 105 

includes means for storing first configuration data wherein includes operating parameter data 106 that includes infor- 
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mation about the state or status of the disk drive unit, as well cation field 108 is also part of the configuration data which, 

as other operating parameters. in Table 1, is located at line 161. At the time the disk drive 

For example, the current state of a disk drive is indicated ^^^^ ^rs^ configured, the user is given an opportunity to 

by a 5 bit "state" code, which is found at line 17 of Table 1 enter a unique identification code for all disk drives. The 
and further described in the remarks that follow through line 5 user may, or may not, choose to assign such an identification 

36. There are 9 stale codes that are defined. "ONL" indicates code. 

that the drive is "on line." "RBL" indicates that information A time stamp that corresponds to the time that the 
from a failed drive is currently being "rebuilt" onto this configuration data is first saved is written into the time stamp 
drive. A rebuilding process may take place after a drive has field 109 of the configuration data. Time stamp field 109 is 
failed, and the data that was lost from the failed drive is lo provided for the purpose of uniquely identifying the con- 
reconstructed on a drive that has been temporarily marked figuration of a di^ drive in the event that the manufacturer 
RBL. "DDD" indicates a "defunct disk drive." Each of these either fails to assign the product I.D. and serial number 
three disk drive states (ONL, RBL and DDD) are referred to information, or such information is stored at a non-standard 
as a "hard configured" state, because these states are only address within the disk drive unit and, in addition, the user 
used to identify a disk drive that is part of the current system 15 also fails to assign a unique user defined I.D. in field 108. 
configuration (note, a defunct drive that is marked DDD is The update count in update count field 110 is initially set 
still considered to be part of the current system to one, and incremented each time the configuration data is 
configuration, despite that fact that it has failed). changed; no matter how small the change. Although incre- 
Drives that are in a "soft" configured state, are not part of menting the update count is preferred, any other change to 
the current system configuration and do not contain valid this field would be permissible, such as decrementing the 
data. There are five such soft configured states. In the "HS" count, as long as a unique update count number is assigned 
or "hot spare" state, the drive is spinning and immediately each time the configuration is changed, 
available for use in the event that a currently active drive A second disk drive unit 111 is illustrated in FIG. 1 as 
fails. In this case, the state of the "failed drive" is changed being connected to bus 101. Disk drive unit 111 is similar to 
from ONL to DDD. Next, the state of the "hot spare drive" disk drive unit 102, and includes power reset 112 and defect 
is changed from HSP to RBL and the state of the failed drive detection 113 subsystems similar to those of disk drive 102. 
is changed from DDD to DHS, and the process of rebuilding Configuration data 114 is stored in disk drive 111 and 
the data that was lost from the failed drive begins. DHS is includes operating parameters 115, product I.D. and serial 
the "defunct hot spare" state. Unlike DDD, DHS is a soft number field 116,user assigned identification field 117, time 
configured state and, therefore, indicates that the drive is no stamp field 118 and update count field 119 similar to those 
longer part of the system configuration. Immediately of disk drive 102. 

thereafter the state of the hot spare drive is changed from goth disk drive units 102 and 111 are designed to be "hot 

RBL to ONL and this dnve is now part of the system swapped" into and out of the RAID system, which means 

configuration. The SHS state or standby hot spare" state that they can be plugged into and unplugged fi^m bus 101, 

IS similar to the HSP state, except that the dnve is not all without turning the power off. Upon plugging one of the 

spinning in this state. j^j^. ^^-^^ ^^-^^ -^^^ 101, power is picked up by the disk 

The "RDY" state indicates that a drive is "ready", which drive unit from the bus, thereby causing a power reset of the 

means that the drive is available for use and can be moved disk drive unit which, in turn, causes a power reset signal to 

into another state, such as the SHS state. Unlike the HSP be sent from the power reset subsystem (e.g., 103 or 112) out 

state, a drive that is in the RDY state usually requires user over the bus. 

intervention to move it to another state. The "SBY" state is raID controUcr 120 includes an internal bus 122 that is 

similar to the RDY state, except that a drive in this state is coupled to SCSI bus 101 via a bus interface unit 121. A 

not spinning. The RDY state is the defauh state for new hard processor 123. RAM 124 and ROM 125 are coupled to 

disk dnves that are inserted into the system, while the SBY internal bus 122. Anon-volatile memory 126 is also coupled 

state is the default state for a new tape or CD ROM drive. imemal bus 122. Non-volatile memory 126 is preferable 

The operating parameters also include an indication of the of the type commonly referred to as a non-volatile RAM or 

physical location of the drive, such as the "channel number" "NVRAM", which is typically a low power CMOS memory 

of line 42 and the "SCSI I.D." of line 43. The channel that is powered or backed-up by a battery, such that the 
number identifies the particular bus or bank that the drive 50 information stored in the non-volatile memory will not be 

has been configured for while the SCSI I.D., which may also lost when the main power is switched off. A timer 127 is also 

be referred to as the "Target I.D.", identifies the particular coupled to internal bus 122 and is used to "time stamp" the 

slot or bay that the drive has been configured for. The configuration data of each of the disk drive units in the 

"EMP" or "empty" state is not technically a drive state, but system. It is not necessary for timer 127 to keep time 
indicates that a particular bay is empty, i.e., no drive is 55 according to some standard method, such as 24 hour time, 

installed in that bay. Instead, timer 127 may use some arbitrary time system in 

The configuration data includes product identification and which the timer is simply incremented at some predeter- 

serial number field 107, which is found in line 38 of Table mined interval, such as 1.3 msec, which is the time interval 

1. The product I.D. and serial number information is used in a preferred embodiment. 

assigned by the manufacturer of the disk drive unit and in 60 Configuration data is not only stored in NVRAM 126, but 

many, but not all cases this information is stored at a also in a well known electrically erasable programmable 

predetermined address within the disk drive unit. When a read only memory (EEPROM) 128, which is also referred to 

disk drive unit is first configured and the configuration data as a "flash" memory. The flash memory 128, however, is 

is first saved, the system takes the manufacturer's product only updated at power on, while any changes to the system 
I.D. and serial number from the predetermined location and 65 configuration during operation are stored in NVRAM 124 

copies it into the product I.D. and serial number field of the and also entered into a "device change list", which is also 

configuration data. A user assigned disk drive unit identifi- stored in NVRAM and on each disk drive. This device 
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change list is described in more detail below with regards to proper location (target I.D. and channel). If not, a "wrong 

the description of Table 2 and later, with regards to FIG. 2. I.D." list is created in step 207. And in next step 208, the 

Collectively, the elements of RAID controller 120, such as controller looks for any drives that are not part of the current 

processor 123 and timer 127, as well as programs and data configuration and that are in hard configured locations; i.e., 

loaded into memories 124-126, provide the means for 5 at target I.D/s that are marked ONLor RBLin the current 

performing the various functions described below. Although configuration. If such an unidentified drive is found, it is 

this programmable computer subsystem is the preferred entered into an "unidentified drive" list in step 209. 

implementation for carrying out the functions described After completing the POST routine of FIG. 2 in which 

below, other embodiments may also be suitable, such as the various change lists may be created, RAID controller BIOS 

use of discrete hardwired logic. will execute the routine of FIG. 3, which begins at step 301. 

Each disk drive unit in the system (e.g., 102 and 111 ) At step 302, the controller looks for a power on change list 

stores, not only its own configuration data, but also con- and, if one is found, branches to step 303. At step 303, the 

figuration data for each and every disk drive unit in the controller looks for new drives in previously empty bays. In 

system. Thus, in addition to the NVRAM and the flash ot^er words, bays that are marked "EMP" in the current 

memory, each disk drive unit has all the configuration data configuration, but now have a drive marked "RDY" or 

for the entire system. Furthermore m addiUon to the con- «SBY." If such a dive is found, its identity is displayed to the 

figuration data for each of the disk drives, other system ^^^^ specifically, the old state, new state, 

management information is also stored in each disk drive, ^^^^^^^ ^. ^^^^.^^ ^ 

typically in the last 16 sectors of the disk. This system , n u -1 • u u 

rnanagement information is illustrated in Table 2, which is a ^02 303 and 304 until all such new dnves have been 

code table that illustrates the various types of system man- 20 displayed in step 3U4. 

agcment data that is stored in each disk. however, a previously configured drive is missing from 

Disk drives are physically attached to a RAID system by system, the program branches at step 303 to step 305 

plugging them into open slots or bays, which are arranged in wherein the identity of the non-responding drive is displayed 

banks. Each bank is referred to as a "channel" and each bay to the user. More specifically, the old state, new state, 

has an associated "target I.D.'* As indicated in Table 2, 25 channel and bay are displayed, wherein the old state is the 

system management information that is indicative of the configured state, and the new state is the state that the 

particular target I.D. and channel that a drive has been controller will assign to the drive. In next step 306, the "F4" 

configured for is stored in lines 3 and 4, respectively. In line function key is enabled, which is a "retry" option. When F4 

5, the configuration data, as described in more detail in Table is enabled, it gives the user an opportunity to correct the 

1, for each of the disk drives in the system is stored. 3Q problem and then select the F4 retry key. For example, a 

Following the configuration data, a "device change list" is drive may not be responding simple because the user failed 

stored, as indicated in line 6. This device change list is to insert the drive in the system. After the error message is 

described in more detail below with regards to FIG. 2. displayed and the retry key enabled, the user may elect to 

After power up, the RAID controller 120 executes a insert the proper drive into the system and then select the 

power on self test routine ("POST'), which is stored in 35 retry key. In addition, the "F5" key is also enabled, which 

ROM 125 and which begins at step 201, as illustrated in instructs the system to assign the new state, which is 

FIG. 2. At next step 202, POST checks for a device change displayed on the screen, to the drive. The program repeats 

list in NVRAM and if one is found, branches to step 203 steps 302, 303, 305 and 306 until all non-responding drives 

wherein flash memory 128 is updated with the latest changes have been displayed. 

to the configuration data, as recorded in the device change 40 In next step 307, the controller checks for a wrong I.D. 

list. If configuration changes are made during system list, which indicates that a previously configured drive has 

operation, the configuration data in the NVRAM is updated been found, but in the wrong location. If such a drive is 

and the change in the configuration data is entered into the found, the program branches to step 308 wherein the identity 

device change list. Although configuration data is also stored of the drive is displayed. More specificafly, the old channel 

in flash memory 128, the flash memory is only updated with 45 and old bay are displayed, followed by the new channel and 

the latest configuration data after a reset of the RAID new bay location. In next step 309, the F6 key is enabled, 

controller or at power on (i.e., at step 203). Thus, the device The F6 key gives the user the option to adjust the configu- 

change list only includes the difference between the con- ration. In other words, the configuration data will be 

figuration data in the NVRAM and the configuration data in adjusted to reflect the new locations of the previously 

the flash memory. After the completion of step 203, the 50 configured drives. In the alternative, the user may elect to 

device change list is either deleted or modified in some physically relocate the drives, and then select the retry key. 

fashion to indicate that there are no current differences Steps 307, 308 and 309 are repeated until all "wrong I.D." 

between the configuration data in the NVRAM and the flash drives have been displayed. 

memory, such as by resetting all the entries in the device in next step 310, the program branches to step 311 if an 

change list to zero. 55 unidentified drive list is found. Instep 311, the idenUty of the 

In next step 204, RAID controller 120 looks for a new unidentified drive is displayed. More specifically, the host 

drive in a previously empty bay. More spedficaUy, the RAID LD., old channel and old bay are displayed, followed by the 

controller is looking for a target I.D. that is marked "EMP" new channel and new bay. The host I.D., old channel and old 

(empty) in the current configuration and that now has a drive bay are from the drive (not from the NVRAM of the current 

in it. If such a new drive is found, it is entered into a "power 60 controller 120), such that it identifies to the user the host 

on change list" at step 205. The RAID controller also looks I.D., channel and bay that the drive was previously config- 

for each drive of the current configuration. If the controller ured for. In most cases, the host I.D., old channel and old bay 

is unable to find a drive from the current configuration will identify the system that the drive was previously 

anywhere in the system, that missing drive is also entered installed in. The new channel and new bay are the channel 

into the power on change list at step 205. 55 and bay that the "unidentified'' drive is cunrenUy installed in. 

In next step 206, RAID controller 120 determines if each In next step 312, the F7 key is enabled. In the alternative, 

hard configured drive of the current configuration is in its selecting the F7 key causes the system to import the con- 
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figuration from the unidentified drives and to update the 
configuration controller 120. This choice is useful when the 
controller has been replaced, or when another system has 
failed and the disks from the failed system are moved to 
functional system. 

Unattended Mode Operation 

The user may configure the system at setup time such that, 
when one or more function keys are enabled, the system will 
select one of the enabled functions automatically, thereby 
removing the need to press a function key. This mode of 
operation is useful when the system is in a remote location 
and not readily accessible to the user. If more than one 
function key is enabled, the function having the correspond- 
ing highest numbered function key will be automatically 
selected. For example, if all function keys are enabled, the 
function corresponding to the F7 key will be selected. If only 
the F6, F5 and F4 keys are enabled, then the function 
corresponding to the F6 key will be selected. If only the F4 
key is enabled, however, no function will be automatically 
selected. Since the purpose of the F4 key is to allow the user 
to physically reconfigure the system, and physical recon- 
figuration can only be accomplished by the user, only the 
user can activate the function corresponding to the F4 key. 

Logical Drive Migration Algorithm 

In a system with a RAID adapter, it would be desirable to 
have logical drive migration (LDM) wherein data could be 
migrated (add/remove physical disk devices) or the existing 
RAID level of a logical drive (array) could be changed to 
provide better fault tolerance for the data. This traditionally 
requires backing up the data, defining a new logical drive, 
and then restoring the data. This invention, however, allows 
a user to change the RAID level of an existing logical drive, 
or to migrate logical drive data by simply issuing an LDM 
command to the adapter, which then provides a redefined 
logical drive configuration structure. 

Logical drive configuration structure information consists 
of data such as RAID level, state, number of chunks (areas 
of disk space), number of sectors of data available to a user, 
and details of each chunk specifying the disk's SCSI 
address, starting sector and number of sectors allocated to a 
logical drive. This invention also permits a user to migrate 
logical drive data belonging to one RAID level to another 
RAID level. The primary objective of this ftinction is to 
provide a mechanism for changing RAID levels. It also 
allows, with some restrictions, shrinkage and expansion of 
the logical drive size. 

The LDM algorithm imposes the following restrictions: 

1. If data between source and destination structures overlap, 
then the source stripe ordering must be preserved. 

2. Start Logical Block Address (LBA) of the first chunk 
(chunk 0) of the destination logical drive must not be 
greater than the start LBA of the first chunk of the source 
logical drive that is to be migrated. 

3. The stripe unit size of the source and destination logical 
drives must be the same. 

The logical drive migration algorithm examines the num- 
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different meaning depending on the direction(s) of migra- 
tion. For example, the start block for the BU direction 
defines the beginning LBA address of the last block, while 
start block for the TD direction defines the beginning LBA 
address of the first block of data. Only one start and one end 
block address are defined when the data migration is only in 
one direction, BU or TD, but two start and two end block 
addresses are defined for data migration in combination 
directions, BUITD or TD/BU. When migrating data, if the 
last stripe to be migrated is not on a full stripe boundary, then 
the last stripe is forced to be on a full stripe boundary, 
without corrupting data, by calculating adjustment to the 
blocks of the last stripe of data. 

This algorithm is illustrated in FIG. 4, wherein the defi- 
nitions of the various symbols are as follows: 
Rx: existing RAID level 
Ry: desired RAID level 
DIR: direction (TD, BU) 
TD: top down 
BU: bottom up 
SBl: start block address 1 
SB2: start block address 2 
EBl: end block address 1 
EB2: end block address 2 
NCS: number of chunks in source 
NCD: number of chunks in destination 
SUS: stripe unit size 
SN: stripe number 
EBS: end block source address 
EBD: end block destination address 

Managing Defective Media For RAID Adapter 

As described above, the RAID 5 algorithm spreads or 
"stripes" data across multiple physical drive. Each physical 
drive is made up of "stripe units" and these stripe units are 
combined across the physical drives to create a stripe of 
data. Each stripe of data also contains a parity stripe unit to 
allow reconstruction of data if a single drive failure occurs. 
Using this parity stripe unit, any stripe unit within the stripe 
can be reconstructed if a physical drive fails. A logical drive 
is made up of multiple stripes and a stripe is made up of 
multiple stripe units wherein each stripe unit is located on a 
unique physical drive. 

When a single physical drive goes defunct and stripe units 
of data cannot be read from that drive, the data can be 
reconstructed using the stripe units of the remaining physical 
drives. A stripe is reconstructed by reading all stripe units in 
a stripe except the failed stripe unit and doing an exclusive 
or operation on the data. In the case of a disk rebuild 
operation, this data is written to a new replacement device 
designated by the end user. When a logical drive rebuild is 
performed, each stripe unit is reconstructed until all stripes 
within that logical drive have been rebuilt. During RAID 5 
rebuild operations if media errors are not handled correctly, 
the user can receive incorrect data from the RAID sub- 
system. 

Media errors on a physical drive can occur that result in 
the device not being able to supply the requested data for a 
stripe unit. If a media error occurs during a logical drive 
rebtiild, then the data on the stripe cannot be reconstructed. 



bcr of source devices, the number of destination devices, the 60 For example, if a media error occurred while reading a stripe 



RAID level change requested and the associated stripe unit 
size to determine the start block, end block, and direction in 
which to migrate logical drive data. The algorithm decides 
whether Top Down (TD), Bottom UP (BU) or combinations 
of TD and BU are to be used as the direction in which data 
must be migrated to preserve data integrity. The definition of 
start block and end block, defining the LBA address, have a 



65 



unit on a first disk, then the corresponding (from the same 
stripe) stripe unit from the second disk could not be recon- 
stmcted. If this error is ignored, the next time the user 
accesses data in the stripe unit of the second disk, incorrect 
data will be supphed since it was not reconstmcted correctly. 

To avoid this problem, some RAID controllers report the 
device with the media errors as a dead device. When this 
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occurs, the entire logical drive will go off liae since the data 
can no longer be reconstructed (at this point there are 2 stripe 
units with unknown data). When a logical drive is off line, 
the user cannot access any data on that logical drive and all 
data on that logical drive is considered to have been lost. To 
bring the logical drive back online, the user must replace the 
device that has the media error, and restore the lost data. 

To overcome this problem, this invention uses a table 
located in the adapter's NVRAM to keep track of stripes that 
could not be reconstructed successfully. If a stripe could not 
be reconstructed successfully during a logical drive rebuild 
operation, an entry is made in a "bad stripe table" containing 
the logical drive and the stripe number that could not be 
rebuilt successfully. When the user tries to access data, this 
table is checked. If there is an entry in the bad stripe table 
for a stripe being accessed, the user will receive an error 
message. Although the user may lose a small portion of the 
data, the user is only presented with an error message instead 
of incorrect data. Also, if a user decides to reformat this 
portion of the disk, a disk formatting utility will also receive 
an error when accessing the portion of the disk in the bad 
stripe table. When this occurs, the formatting utility will 
reassign the failing area to another area on the logical drive. 
The bad stripe table can also be checked on write operations. 
If an entire stripe of data is written successfully and that 
stripe is found in the bad stripe table, the entry is removed. 
To write the entire stripe, sectors that have non-recoverable 
media errors need to be reassigned by the disk drive or Raid 
adapter. 
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This invention can be further enhanced to only keep track 
of the stripe unit that could not be rebuilt, or by determining 
which sector (a smaller portion of disk space than a stripe 
unit) is bad. When the bad sector is determined, only that 

5 sector is placed in the bad stripe table instead of the stripe 
unit or stripe number. The failing sector can be determined 
by doing single sector reads to the entire stripe unit or stripe 
number that is failing. If a sector read fails, then only that 
sector is placed in the bad stripe table. If a single sector write 

10 is done successfully to a sector that is contained in the bad 
stripe table, then the entry is removed from the table. To 
write a single sector, sectors that have non- recoverable 
media errors need to be reassigned by the disk drive or 
RAID Adapter. 

IS The bad stripe table should also be stored on the physical 
drives attached to the controller. Each time an entry is made 
in the NVRAM bad stripe table, the bad stripe table is copied 
to a reserved area in each physical drive. If the physical 
drives are moved to another controller, the bad stripe table 

20 is copied from the physical drives reserved area to the new 
controller's NVRAM bad stripe table so that this informa- 
tion is not lost. 

Another consideration is the layout of the bad stripe table. 
If there are a lot of entries in the bad stripe table, accessing 

25 the bad stripe table for each I/O command can slow down 
performance. The bad stripe table can be split into multiple 
tables for each logical or physical device. Also, each entry 
can be maintained in order and searched using a binary 
search algorithm. 



TABLE 1 



Adeline SCHNS 3 
// Comments with 

// ro/wi is read only write ignore field 

// i/w is read write field 

// ro is read only field 

// =====««„=»„«====^ — = 



typedef struct 
{ 

UCHAR uclnitiator, 
UCHAR ucParamctcrs; 



UCHAR ucMiscFlag; 
UCHAR ucStatc; 



ULONG ulBlockCount; 
UCHAR ucDcviccID[2fi]; 
} DEVSTTATE; 



// 36.bytc SCSI device state 

// Initiator - 1, Target - 0, ro/wi 

// Bits 0-4 device type: matches, ro/wi 

// Inquiry cmd: 0 - Disk, 1 = Up e, etc. 

// Bit 5 : 0 - 5 MHZ, 1 = Fast Scsi 

// Bit 6 : 0 » 8 bit, 1 = 16 bit 

// Bit 7 : 0 = no tag que, 1 o tag que 

// Bit 0 - PFA, ro/wi 

// Bit 1 = Stale data 

// 00 = EMP. 04 - DHS, 08 = DDD, 01 = SBY, r/w 

// 81H = RDY. 85H « HSP, 89H - ONL, 8BH « RBL 

// 05 = SHS 

// Bit 7 = Spin, 

// Bit 3 - Configured, 

// Bit 2 - HotSpare, 

// Bit 1 = Rebuild in progress, 

// Bit 0 = Present: true if device can 

// pass SCSI Tsst Unit Ready command. 

// Bit 7 3 2 1 0 State 

//SPN CFG HSP RBLTUR 

// 1 1 0 0 1 ONL 

// 1 0 1 0 1 HSP 

// 1 0 0 0 1 RDY 

// 1 1 0 1 1 RBL 

// 0 1 0 0 0 DDD 

// 0 0 1 0 0 DHS 

// 0 0 0 0 1 SBY 

// 0 0 1 0 1 SHS 

// 0 0 0 0 0 BMP 

// Tbtal # sectors not inc reserved, ro/wi 
// Inquiry product id & serial number, ro/wi 
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TABLE 1 -continued 



typcdcf struct 
{ 

UCHAR ucChn; 
UCHAR ucTgt; 
USHORT usMisc; 



ULONG ulStartScct; 

ULONG ulNoOfSccts, 
} CHUNK; 
typedef struct 
{ 

USHORT usUserField; 
UCHAR ucState; 



UCHAR ucRaidCacheParam; 



UCHAR ucNoOfChunkUnits; 
UCHAR ucStripeSize; 



UCHAR ucFaranis; 



UCHAR ucReaerved; 
ULONG ulLogDrvSizc; 
CHUNK chunl£(MAX_CHUNKS]; 
} LOGICALDRIVE; 



// Channel number: 0 - NCHNS-1, r/w 

// SCSI id R*-: 0 - MAX_TGT-1, r/w 

// set to zero except in CHUNKIO], CHUNK(1] and 

//CHUNK(2J 

// CHUNK[3] is in each logical drive is reserved for 
// clieat server admin. 

// In CHUNK[0] and CHUNK[1] it is as follows 

// bits 1 0:00 ch 0, lo id => bank 0 

// 01 ch 0, lo id => bank 1 

// 10 ch 0, lo id => bank 2 

// bits 3 2:00 ch 0, hi id => bank 0 

// 01 ch 0, hi id => bank 1 

// 10 ch 0, hi id -> bank 2 

// bits 5 4:00 ch 1, lo id bank 0 

// 01 ch 1, lo id => bank 1 

// 10 ch 1, lo id => bank2 

// bits 7 6:00 ch 1, hi id => bank 0 

// 01 ch 1, hi id o> bank 1 

// 10 ch 1, hi id => bank2 

// bits 9 8:00 ch 2, lo id => bank 0 

// 01 ch 2, lo id -> bank 1 

// 10 ch 2, lo id -> bank 2 

// bits b a:Oa ch2, hi id -> bank D 

// 01 ch 2, hi id -> bank 1 

// 10 ch 2, hi id -> bank 2 

// bits d c:00 ch 3, lo id -> bank 0 

// 01 ch 3, lo id -> bank 1 

// 10 ch 3, lo id -> bank 2 

// bits f e:00 ch 3, hi id -> bank 0 

// 01 ch 3. hi id -> bank 1 

// 10 ch 3, hi id -> bank 2 

//Adapter docs not process these bits. 

// it is user defined 
// Start sector for this logical drive, r/w 
// No. of sectors allocated to this log drv, r/w 



// User defined, r/w 

// Cunent status of logical drive = r/w 

// 03H - OKaY 

// 04H - CRiTical 

// 02H - OFfLine 

// OOh - FREe 

// 05H - LDM 

// 06H - SYS 

// 24H - CRS (Critical SYS) 
// 14H - CRM (Critical LDM) 

// bits 2-0 - Raid level (0,1,5) r/w 

// bit 3-6 - 0 

// bit 7 -1 -> Write-Back 

// 0 «> Write-Through 

// No. of chunk units in parity band 1 to 16, r/w 

// 0 - 3 reserved for future expansion, r/w 

// 4 " Sk stripe unit 

// 5 o 16K stripe unit 

// 6 - 32K stripe unit 

// 7 - 64K stripe unit 

// 8 + reserved for future 

// NOTE: RAID Adapter RF supports 8K to 64K stripe. 

// bit 0 - 0, reserved 

// bit 2 - 0 data scrubbing is enabled 

// 1 data scrubbing is disabled 

// bit 3 - cache bypass on read, ro 

// bit 4 - 0 copy of write data also in Posiedon, r/w 

// 1 dual copy disabled 

// bit 5 - 0 read- ahead enabled, r/w 

// 1 read- a head disabled 

// bit 6 - 0 Reserved 

// bits 1 and 7-0 reserved for future 

// used by the configurator 

// size of the logical drive, ro 

// details of each chunk unit 
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TABLE 1 -continued 



typcdcf struct 
{ 

UCHAR board_di5c(8]; 
UCHAR proocssoiiS]; 
UCHAR ucNoChanType; 



UCHAR ucNoHostlntType; 



UCHAR ucCompression; 
UCHAR ucNvramiype; 

ULONG ulNvramSize; 
} HARDWAREDISC; 
typcdcf struct 
{ 

UCHAR ucNoOOxjgDrivcs; 
UCHAR ucDatcD; 
UCHAR ucDatcM; 
UCHAR ucDatcY; 

UCHAR imt_id[ 4 I 



UCHAR host_id[ 12 ]; 

UCHAR time_sigii[ 8 ]; 
stmct 
{ 

UINT cfgdrv_updcnt 16; 
UINT startup_delay :4; 
UINT concur_drvstart :4; 
U[NT reserved :3; 
UINT stage :1; 
UINT cluster :1; 
UINT bios_compat :1; 
UINTT boot_cdr :1; 
UINT auto_rearrange :1; 
}UserOpl; 

USHORT user_field; 
UCHAR ucRebuildRate; 

UCHAR ucReserve; 
} CONFHDR; 
typedef struct 
{ 

CONFHDR 
HARDWAREDISC 
LOGICALDRIVE 



// board description in ascii. For RAID Adapter RF ro 

// this field is 'CPH RF3C* 

// local processor description in ascii. ro 

// For RAID Adapter RF this field is '403GC-33' 

// bits 0-3 - number of 1/0 (SCSI) channels, ro 

// bits 4-6 - channel type, ro 

// 000 parallel SCSI wide 

// note: RAID Adapter RF only 

// supports parallel wide SCSI 

// 001 parallel SCSI narrow 

// 010 serial SSA 

// on serial P1394 

// 100 serial FC-AL 

// bit 7 - 0 reserved for future 

// number and type of host interface, ro 

// bits 0-3 - number of host interface 

// bits 4-6 - interface type 

// 000 32 bit PCI 

// 001 64 bit PCI 

// 010 80 MB/sec Micro Channel 

// Oil 40 MB/sec Micro Channel 

// 101 EISA 

// bit 7 - 0 reserved for future 

// type of compression hardware, 0 for none, ro 

// NVRAM cache type, 0 for no cache nvram, ro 

// 1 for Posiedon, ro 

// size of nvram in sectors when present, ro 



// No. of logical drives: 1 - 8, r/w 

// date of configuration written, r/w 

// date of configuration written, r/w 

// date of configuration written, r/w 

// packed ddmmyy 

// Initiator Id 

// bits 0-^ 

// Speed bits 5-7 

// OOOx xxxx - lOMHZ 

// 00 Ix xxxx - 20MHZ 

// 01 Ox xxxx - 5MHZ 

// 11 Ix xxxx - OMHZ 

// Chassis # from host, r/w 

// Hme stamp from RAID Adapter RF clock, ro 



// count of cfgupdate since write cfg 
// delay (sec) between s pinup groups 
// # of drives to spinup in parallel 
// reserved 

// 1 = stage data during migration 

// 1 = adapter is part of a cluster 

// 1 = compatibility, 0 = extended 

// 1 = boot from CDROM, 0 = from drive 

// 1 = auto rearrange, 0 - just list 

// User defined 

// Rebuild Rate, r/w 

// Olh - low, 08h - mid, OFh - high 

// set to zero 



DEVSTATE 



} CONFIGURATION; 



confighcadcr; 
hardwarc_disc; 

logical_drive[ MAX_LOG_DRVS }; 
dev[ NCHNS ][ MAX_TGT + 1 ]; 



// configuration header 
// hardware description 
// describing each logical 
//drv 

// Info about device at 
// each adr 
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UCHAR ucRcpTblCnt; // Number of entries in 
Replaccmcnt'Tbl 

UCHAR ucDcvChgCnt; 

UCHAR tgt; //SCSI Ttirgct ID 

UCHAR chn; //SCSI Channel 

COhO^GURATION cfg; // Full copy of configuration struct 

DEVCHGLST DcviceChangeList[ MAX_DEV_CHO_LST ]; 

RTENTRY ReplacementTbl[ MAX_LOG_DRVS ]; 



We claim: 

1, A RAID system for use with removable and identifiable 
storage devices that can be attached to said RAID system at 
a plurality of different attachment points, said RAID system 
comprising: 

means for storing first configuration data, said first con- 
figuration data being indicative of a first system con- 
figuration in which a first plurality of removable stor- 
age devices are configured for attachment to said RAID 20 
system, said first configuration data including informa- 
tion regarding the identity, attachment point and the 
operational state of each one of said first plurality of 
removable storage devices; 

means for comparing the first configuration data stored in 25 
said RAID system to a second system configuration and 
for identifying any new storage devices that are a part 
of the second system configuration but are not a part of 
the first system configuration, for identifying any non- 
responding storage devices that are a part of the first 30 
system configuration but are not responding in the 
second system configuration, for identifying any repo- 
sitioned storage devices that were located at a one 
attachment point in the first system configuration and 
were repositioned to a different attachment pomt in the 35 
second system configuration, and for identifying any 
unidentified storage devices that are a part of the 
second system configuration but are not part of the first 
system configuration, and for recording the identities 
and attachment points of any such new, non- 40 
responding, repositioned, and unidentified storage 
devices in a change list; 

means for displaying the identities and attachment points 
of any storage devices that are listed in the change list; 

means for enabling an "accept change" key in response to 
the listing of a non-responding storage device in the 
change list, said accept change key, when activated by 
a user, causing the first configuration data to be modi- 
fied by changing the current state of any non- 
responding drive to a new state; 

means for enabling a "configuration adjustment" key in 
response to the listing of a re-positioned storage device 
in the change list, said configuration adjustment key, 
when activated by a user, causing the first configuration 
data to be modified to reflect any changes in the 
attachment points of any repositioned storage devices; 

means for enabling an "import configuration" key in 
response to the listing of an unidentified storage device 
in the change list, said import configuration key, when 
activated by a user, causing the first configuration data 
to be modified to correspond to the system configura- 
tion of an unidentified storage device; and 

means for enabling a "retry" key in response to the listing 
of a storage device in the change Ust, said retry key, 65 
when activated by a user, causing said means for 
comparing to re-compare the first configuration data 



stored in said RAID system to the second system 
configuration, thereby giving the user an opportunity to 
rearrange the configuration of the second system con- 
figuration to correspond to the first system configura- 
tion. 

2. A RAID system for use with removable and identifiable 
storage devices that can be attached to said RAID system at 
a plurality of different attachment points, said RAID system 
comprising: 

means for storing first configuration data, said first con- 
figuration data being indicative of a first system con- 
figuration in which a first plurality of removable stor- 
age devices are configured for attachment to said RAID 
system, said first configuration data including informa- 
tion regarding the identity, attachment point and the 
operational state of each one of said first plurality of 
removable storage devices; 

means for comparing the first configuration data stored in 
said RAID system to a second system configuration and 
for identifying any unidentified storage devices that arc 
a part of the second system configuration but are not 
part of the first system configuration, and for recording 
the identities and attachment points of any such uni- 
dentified storage devices in a change List; 

means for displaying the identities and attachment points 
of any storage devices that are listed in the change list; 
and 

means for enabling an "import configuration" key in 
response to the listing of an unidentified storage device 
in the change list, said import configuration key, when 
activated by a user, causing the first configuration data 
to be modified to correspond to the system configura- 
tion of an unidentified storage device. 

3. The RAID system of claim 2, wherein: 

said means for comparing further includes means for 
identifying any non-responding storage devices that are 
a part of the first system configiu-ation but are not 
responding in the second system configuration, and for 
recording the identities and attachment points of any 
such non-responding storage devices in the change list; 
and 

said RAID system further includes means for enabling an 
"accept change" key in response to the listing of a 
non-responding storage device in the change list, said 
accept change key, when activated by a user, causing 
the first configuration data to be modified by changing 
the current state of any nonresponding drive to a new 
state. 

4. A RAID system for use with removable and identifiable 
storage devices that can be attached to said RAID system at 
a plurality of different attachment points, said RAID system 
comprising: 

means for storing first configuration data, said first con- 
figuration data being indicative of a first system con- 
figuration in which a first plurality of removable stor- 
age devices are configured for attachment to said RAID 
system, said first configuration data including informa- 
tion regarding the identity, attachment point and the 
operational state of each one of said first plurality of 
removable storage devices; 

means for comparing the first configuration data stored in 
said RAID system to a second system configuration and 
for identifying any non-responding storage devices that 
are a part of the first system configuration but are not 
responding in the second system configuration, and for 
recording the identities and attachment points of any 
such nonresponding storage devices in a change list; 
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means for displaying the identities and attachment points 
of any storage devices that are listed in the change list; 
and 

means for enabling an "accept change" key in response to 
the listing of a non-responding storage device in the 5 
change list, said accept change key, when activated by 
a user, causing the first configuration data to be modi- 
fied by changing the current state of any non- 
responding drive to a new state. 
5. The RAID system of claim 4, wherein: lO 
said means for comparing further includes means for 
identifying any repositioned storage devices that were 
located at a one attachment point in the first system 
configuration and were repositioned to a different 
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attachment point in the second system configuration, 
and for recording the identities and attachment points 
of any such repositioned storage devices in the change 
list; and 

said RAID system further includes means for enabling a 
* 'configuration adjustment" key in response to the list- 
ing of a repositioned storage device in the change list, 
said configuration adjustment key, when activated by a 
user, causing the first configuration data to be modified 
to reflect any changes in the attachment points of any 
repositioned storage devices. 

« )|e DC i)e )|c 
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